Analysis-by-synthesis low-rate multimode harmonic speech coding

نویسندگان

  • Chunyan Li
  • Allen Gersho
  • Vladimir Cuperman
چکیده

This paper presents an analysis-by-synthesis multimode harmonic coder (AbS-MHC) that employs new techniques to improve both the speech model accuracy and the parameter estimation robustness in the low rate harmonic coding framework. To improve the speech model accuracy, an enhanced frequency domain transition model is used in conjunction with the sinusoidal model based harmonic coding of voiced/unvoiced speech signals. To achieve robust parameter estimation, a generalized analysis-by-synthesis parameter estimation scheme in the harmonic coding framework is proposed. This scheme uses a time scale signal modification technique to allow for waveform matching in harmonic coding. This concept is demonstrated in our AbS-MHC coder with a specific method for efficient closed-loop pitch estimation and speech classification. The speech quality of the unquantized AbS-MHC coder is better than the 6.3 kbps G.723 quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis-by-synthesis multimode harmonic speech coding at 4 kb/s

This paper presents a 4 kb/s Analysis-by-Synthesis Multimode Harmonic Coder (AbS-MHC). Novel features of this coder include a signal modification technique that allows time-domain analysisby-synthesis parameter estimation in sinusoidal coding framework, and a frequency-domain transition speech model with improved parameter estimation and quantization schemes. An efficient quantization scheme fo...

متن کامل

Polynomial quasi-harmonic models for speech analysis and synthesis

Harmonic plus noise models have been successfully applied to a broad range of speech processing applications, including, among others, low bit-rate speech coding, and speech restoration and transformation. In conventional methods, the frequencies, the relative phases and the amplitudes of the pitch-harmonic components are assumed to be piecewise constants over an analysis frame. This assumption...

متن کامل

Enhanced harmonic coding of speech with frequency domain transition modelling

A major source of audible distortion in current low-bit-rate harmonic speech coding algorithms is the ineffective modeling of the transitional speech signals such as onsets, plosives etc.. A new method of modeling transitional speech based on a frequency domain approach is introduced in this paper. The approach uses a modified harmonic model able to produce non-periodic pulse sequences in conju...

متن کامل

Multimode Tree Coding of Speech with Perceptual Pre-weighting and Post-weighting

A low delay and low complexity speech coder based on Multimode Tree Coding is proposed. In our Multimode Tree Coder, a simple mode classification method along with frame energy are used to classify the input speech frames into five different modes. Each mode is coded at a suitable bit-rate using a Tree coder with computationally efficient perceptual error pre-weighting and post-weighting filter...

متن کامل

Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

ALISP (Automatic Language Independent Speech Processing) units are an alternative concept to using phoneme-derived units in speech processing. This article describes advances in very low bit rate coding using ALISP units. Results of speakerindependent experiments are reported and speaker clustering using vector quantization is proposed. The improvements of speech re-synthesis using Harmonic Noi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999